CDS

Accession Number TCMCG019C08678
gbkey CDS
Protein Id XP_022938301.1
Location complement(join(1859058..1859174,1859698..1859790,1859900..1860020,1860714..1860856,1860954..1861035,1861610..1861821,1862015..1862142,1862215..1862382,1862476..1862592,1862773..1862869,1862988..1863002))
Gene LOC111444435
GeneID 111444435
Organism Cucurbita moschata

Protein

Length 430aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA418582
db_source XM_023082533.1
Definition putative protease Do-like 14 isoform X2 [Cucurbita moschata]

EGGNOG-MAPPER Annotation

COG_category O
Description protease Do-like
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko01002        [VIEW IN KEGG]
ko03110        [VIEW IN KEGG]
KEGG_ko ko:K08669        [VIEW IN KEGG]
ko:K08784        [VIEW IN KEGG]
EC 3.4.21.108        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko04210        [VIEW IN KEGG]
ko04214        [VIEW IN KEGG]
ko04215        [VIEW IN KEGG]
ko05012        [VIEW IN KEGG]
map04210        [VIEW IN KEGG]
map04214        [VIEW IN KEGG]
map04215        [VIEW IN KEGG]
map05012        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGATCCCGCTTCTTAGAAAGGTTTCGAGCTCACGCAACACGCTCGGACGGATCGCTGCAATTGCTGCTGCTGGTTCTTGTTTCTGGTACGCCGGAAGCAAATTAGATAATGGATCCTCTGTAGTGTTGTCAATTCCTGCTGCTTTGAGTGAGCCACTGTTTCTTCCATGGCAGACCGCACACGGCTTCACGATTCATCCTTCTGGTGCATTTGATCACCAGAAATTGGGTCTTTCATTTTGTTCTTCAAGAGTCAGTCCTGCTCCACCATCTGGTGTGGAGAAGGAAAAGCCTGGAGATACGCAGAAGCCTTGTCCAAGATGTTTGGATAGAGATACAATTGCAAATGCTGCAGCAGATGTCGGCCCTGCTGTTGTAAATATTTCTGTTTCACATGGTATTTACGGAATTGCTACTGCTAAAAGCATGGGATCCGGAACAATTATTGACAAGGATGGTACTATTTTAACATGTGCCCATGTCGTGACGGATTTTCATGGTCCACGAGCTGCATCCAAAGGAAAGGTAGAGGTTACTCTACAAGATGGTCGGACATTTGAAGGGACAGTAATGAATGCTGATTTTCACTCTGATATTGCCATTGTGAAAATCAATTCTAAAAGCCCTCTTCCCATGGCAAAGCTTGGTTCTTCAAGCAAGCTCCGACCAGGGGATTGGGTTGTAGCAATTGGGTGTCCACTTTCGCTTCAGAATACTGTCACAGCTGGTATAGTAAGTTGTGTTGACCGTAAGAGTAGTGATTTGGGTCTTGGTGGAATGCGAAGGGAATATCTACAAACAGATTGTGCAATTAACGTGGGAAATTCTGGGGGTCCTCTTGTTAATGTGGATGGAGAAGTTATTGGTGTAAATATTATGAAAGTGGATGATGCCGTTGGATTAAGTTTCGCTGTACCAATTGATTCAGTCTCCAAAATTACAGAGCAATTCAAGAAAAGAGGGAGAGTTATTCGGCCTTGGCTTGGATTGAAAATGATCGATCTCAACGAAATGATAATCGAACAACTTAAAGAAAGAGATGCATCTTTTCCAGACGTTACTAAAGGGGTTCTTGTAGCCATGGTAACTCCTGGATCCCCTGCTAGTCGTGCTGGGTTCCGTCCTGGTGATGTCGTCATCGAGTTCGATAAGCAACCTGTTGGCAGTATCCAAGAGATCATTGAAATTATGGGAGATAGAGTTGGGATTCCATTGAAGGCAGTTGTGAAAAGATCACTTAATGGCATCATCACTCTGACTGTTCTTCCTGAGGAGTCCAATCCAGATATGTGA
Protein:  
MIPLLRKVSSSRNTLGRIAAIAAAGSCFWYAGSKLDNGSSVVLSIPAALSEPLFLPWQTAHGFTIHPSGAFDHQKLGLSFCSSRVSPAPPSGVEKEKPGDTQKPCPRCLDRDTIANAAADVGPAVVNISVSHGIYGIATAKSMGSGTIIDKDGTILTCAHVVTDFHGPRAASKGKVEVTLQDGRTFEGTVMNADFHSDIAIVKINSKSPLPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEVIGVNIMKVDDAVGLSFAVPIDSVSKITEQFKKRGRVIRPWLGLKMIDLNEMIIEQLKERDASFPDVTKGVLVAMVTPGSPASRAGFRPGDVVIEFDKQPVGSIQEIIEIMGDRVGIPLKAVVKRSLNGIITLTVLPEESNPDM